Accelerating Best Response Calculation in Large Extensive Games

نویسندگان

  • Michael Johanson
  • Kevin Waugh
  • Michael H. Bowling
  • Martin Zinkevich
چکیده

ion Size (# information sets) A X Y To determine our payoff at A, we need to know the distribution over the opponent being in X and Y. Recursive tree walk algorithm: PASS FORWARDS: An array of probabilities of the opponent being in each of their information sets (X and Y) RETURN: Our value at our information set, given the opponent distribution. Only visits each game state once. But in big domains (1018 in our game) this is intractable. REACH: X: 0.9 Y: 0.25 VALUE: A: $0.25 Four steps for accelerating best response computation in imperfect information games

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Government and Central Bank Interaction under Uncertainty: A Differential Games Approach

Abstract Today, debt stabilization in an uncertain environment is an important issue. In particular, the question how fiscal and monetary authorities should deal with this uncertainty is of much importance. Especially for some developing countries such as Iran, in which on average 60 percent of government revenues comes from oil, and consequently uncertainty about oil prices has a large effec...

متن کامل

Convergence of best-response dynamics in extensive-form games

We prove that, in all finite generic extensive-form games of perfect information, a continuous-time best response dynamic always converges to a Nash equilibrium component. We show the robustness of convergence by an approximate best response dynamic: whatever the initial state and an allowed approximate best response dynamic, the state is close to the set of Nash equilibria most of the time. In...

متن کامل

Using Response Functions to Measure Strategy Strength

Extensive-form games are a powerful tool for representing complex multi-agent interactions. Nash equilibrium strategies are commonly used as a solution concept for extensive-form games, but many games are too large for the computation of Nash equilibria to be tractable. In these large games, exploitability has traditionally been used to measure deviation from Nash equilibrium, and thus strategi...

متن کامل

An Exact Double-Oracle Algorithm for Zero-Sum Extensive-Form Games with Imperfect Information

Developing scalable solution algorithms is one of the central problems in computational game theory. We present an iterative algorithm for computing an exact Nash equilibrium for two-player zero-sum extensive-form games with imperfect information. Our approach combines two key elements: (1) the compact sequence-form representation of extensiveform games and (2) the algorithmic framework of doub...

متن کامل

Best response equivalence

Two games are best-response equivalent if they have the same best-response correspondence. We provide a characterization of when two games are best-response equivalent. The characterizations exploit a dual relationship between payoff differences and beliefs. Some “potential game” arguments [Games Econ. Behav. 14 (1996) 124] rely only on the property that potential games are best-response equiva...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011